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Abstract 

' Consider a linear regression model with n-dimensional response vector, p- dimensional 

regression parameter (3 and independent normally distributed errors. Suppose that 

(N ■ 

the parameter of interest is 9 = a T (3 where a is a specified vector. Define the 
o |' _ 

s- dimensional parameter vector r = C (3 — t where C and t are specified. Also 

i/-) ■ suppose that we have uncertain prior information that r = 0. Part of our evalu- 

ation of a frequentist confidence interval for 9 is the ratio (expected length of this 
f-H ■ confidence interval) /(expected length of standard 1 — a confidence interval), which 

we call the scaled expected length of this interval. We say that a 1 — a confidence in- 
terval for 9 utilizes this uncertain prior information if (a) the scaled expected length 
of this interval is significantly less than 1 when r = 0, (b) the maximum value of 
^ ■ the scaled expected length is not too large and (c) this confidence interval reverts to 

\ the standard 1 — a confidence interval when the data happen to strongly contradict 

the prior information. Let = a T /3 and r = C T /3 — t, where (3 is the least squares 
estimator of (3. We consider the particular case that that E((t - t)(0 - 9)) = 0, 



so that O and r are independent. We present a new 1 — a confidence interval for 



CO 



X 



9 that utilizes the uncertain prior information that r = 0. The following problem 
is used to illustrate the application of this new confidence interval. Consider a 2 3 
factorial experiment with 1 replicate. Suppose that the parameter of interest 9 is 
a specified linear combination of the main effects. Assume that the three-factor in- 
teraction is zero. Also suppose that we have uncertain prior information that all of 
the two-factor interactions are zero. Our aim is to find a frequentist 0.95 confidence 
interval for 9 that utilizes this uncertain prior information. 
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1. Introduction 

Suppose that the parameter of interest 9 is a scalar and that we have uncertain 
prior information about the parameters of the model. Hodges and Lehmann (1952), 
Bickel (1984) and Kempthorne (1983, 1987, 1988) show how such uncertain prior 
information can be utilized in frequentist inference, mostly for point estimation of 9. 
A confidence interval for 9 is said to be a 1 — a confidence interval if it has infimum 
coverage probability 1 — a. We assess a 1 — a confidence interval J by its scaled 
expected length, defined to be the ratio (expected length of J) / (expected length 
of the standard 1 — a confidence interval for 9). The first requirement of a 1 — a 
confidence interval that utilizes the uncertain prior information is that its scaled 
expected length is significantly less than 1 when the prior information is correct 
(Kabaila, 2009). 

We classify confidence intervals that satisfy this first requirement into the fol- 
lowing two groups. The first group consists of 1 — a confidence intervals with scaled 
expected length that is less than or equal to 1 for all parameter values, so that these 
dominate the standard 1 — a confidence interval. An example of such a confidence 
interval is the Stein-type confidence interval for the normal variance (see e.g. Maata 
and Casella, 1990 and Goutis and Casella, 1991). The second group consists of 1 — a 
confidence intervals that satisfy this first requirement, when dominance of the usual 
1 — a confidence interval is not possible (the scaled expected length must exceed 
1 for some parameter values). Some relevant admissibility results are provided by 
Kabaila, Giri and Leeb (2010) and Kabaila (2011). This second group includes the 
confidence intervals described by Pratt (1961), Brown et al (1995) and Puza and 
O'Neill (2006ab). This second group also includes I — a confidence intervals that sat- 
isfy the additional requirements that (a) the maximum (over the parameter space) 
of the scaled expected length is not too much larger than 1 and (b) the confidence 
interval reverts to the standard 1 — a confidence interval when the data happen to 
strongly contradict the prior information. Confidence intervals that utilize uncertain 
the prior information and satisfy these additional requirements have been proposed 
by Farchione and Kabaila (2008) and Kabaila and Giri (2009ab) (cf Kabaila and 
Giri, 2013). 

Consider the linear regression model Y = X{3 + e , where Y is a random n- 
vector of responses, X is a n x p matrix with linearly independent columns, (3 is an 
unknown parameter p- vector and e ~ iV(0, cr 2 I n ), where a 2 is an unknown positive 
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parameter. Suppose that the parameter of interest is 9 = a T f3, where a is a given p- 
vector (o ^ 0). Let the s-dimensional parameter vector r be defined to be C T (3 — t 
where C is a specified px s matrix (s < p) with linearly independent columns and t is 
a specified s- vector. Suppose that a does not belong to the linear subspace spanned 
by the columns of C. Also suppose that previous experience with similar data sets 
and/or expert opinion and scientific background suggests that r = 0. In other 
words, suppose that we have uncertain prior information that r = 0. Our aim is to 
find a frequentist I — a confidence interval for 9 that utilizes this prior information. 
By "utilizes this prior information" we mean that (a) the scaled expected length of 
this interval is significantly less than 1 when r = 0, (b) the maximum value of the 
scaled expected length is not too large and (c) this confidence interval reverts to the 
standard 1 — a confidence interval when the data happen to strongly contradict the 
prior information. 

Kabaila and Giri (2009a) have dealt with the case that s — 1, so we consider 
the case that s > 2. Let (3 denote the least squares estimator of (3. Also let 
t 2 = (Y - X(3) T (Y - X$)/(n - p), 6 = a T /3 and f = C T (3 - t. We consider 
the particular case that E({t — t)(0 — 9)) = 0. An example of this particular case 
is the following. Consider a 2 3 factorial experiment with 1 replicate. For factorial 
experiments, it is a widely-held belief that the higher the order of interaction, the 
more likely it is to be negligible. Indeed, fractional factorial designs are based on this 
belief. Assume that the third-order interaction is zero. Also suppose that we have 
uncertain prior information that all of the second-order interactions are zero. In this 
case, n — p = 1 and s = 3. If the parameter of interest 9 is a linear combination of 
the main effects then E((t - t)(0 - 9)) = 0. 

In Section 2, we describe the new 1 — a confidence interval for 9 that utilizes the 
uncertain prior information that r = 0. Define 7 = (Cov(r)) r. The coverage 
probability and scaled expected length of this new confidence interval are even func- 
tions of ||7|| = \J 7 T 7- In Section 3, we consider this 2 3 factorial experiment, when 
1 — a — 0.95. Figure 2 presents graphs of the squared scaled expected length and the 
coverage probability of this new confidence interval (as functions of The in- 

fimum coverage probability is computed to be 0.95. To an excellent approximation, 
the coverage probability of the new confidence interval is equal to 0.95, throughout 
the parameter space. This figure demonstrates that this new confidence interval has 
excellent performance in terms of squared scaled expected length. When the prior 
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information is correct (i.e. | | = 0), we gain since the square of the scaled ex- 
pected length is 0.34707, which is much smaller than 1. The maximum value of the 
square of the scaled expected length is 1.0404, which is only slightly larger than I. 
The new 0.95 confidence interval for 9 coincides with the standard 1 — a confidence 
interval when the data strongly contradicts the prior information. This is reflected 
in Figure 2 by the fact that the square of the scaled expected length approaches 
1 as ||7|| — > 00. In Section 4, we examine the effect on the performance of the 
new confidence interval of increasing s, for n — p — 1. The application of the new 
confidence interval is to the case that n — p is small. As pointed out in Section 2, 
the ability of the new confidence interval to utilize the uncertain prior information 
comes from enhanced estimation of a 2 . The smaller n — p is, the larger will be this 
enhancement. 

2. New confidence interval that utilizes the uncertain prior information 

Our first step is to reduce the data to (0,r, E 2 ). Let Vu = a T (X T X)~ l a and 
V22 = C T {X T X)- l C. Note that ~ N(9,a 2 v u ), f ~ N(r,a 2 V 22 ) and E 2 are 
independent random vectors. Let m = n — p. Define the quantile t{m) by the 
requirement that P( — t(m) <T< t(m)) = 1 — a for T ~ t m . The standard 1 — a 
confidence interval for 6 = a T (3 is 

I = [e - t(m)y/v^ E, 6 + t(m)v^7 E] . 

The new confidence interval for 9 that we will describe shortly is centered at 0. 
The fact that and f are independent suggests that the uncertain prior information 
that r = should not influence the point estimation of 6. However, this uncertain 
prior information can be used to enhance the estimation of a 2 . In the absence of any 
prior information about r, the standard estimator of o 2 is E 2 and mE 2 /<j 2 ~ Xm- 
However, if it known that r = then that standard estimator of a 2 is 

~ 2 mt 2 + f T V 22 l f ~ 2 (m + sF s 



m + s \ m + s 

where F = (f T V^ 1 r/s)/E 2 and (m + s)E 2 /cr 2 ~ x 2 m+s - 

This suggests that the 

uncertain prior information that r = can be used to enhance the estimation of a 2 
by using the appropriate function of E 2 and F. 

This motivates us to consider a new confidence interval for 9 of the form 

J(d) = [© - v ^7Erf(v / F), + v ^7Erf(v / F) 
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where the function d : [0, oo) — > (0, oo) is required to satisfy the following restric- 
tions. 

Restriction 1 d is a continuous function. 

Restriction 2 d(x) = t(m) for all x > k, where A; is a specified positive number. 

The first restriction implies that the endpoints of the confidence interval J(d) are 
continuous functions of the data. Note that F is the usual F statistic for testing 
the null hypothesis H : t — against the alternative hypothesis Hi : r ^ 0. 
Thus the second restriction implies that this confidence interval reverts to the usual 
1 — a confidence interval I when the data happen to strongly contradict the prior 
information that r = 0. 

Part of the evaluation of the confidence interval J(d) consists of comparing it 
with the usual 1 — a confidence interval / using the scaled expected length crite- 
rion (expected length of this confidence interval) / (expected length of J). Theorem 
1, which is stated and proved in Appendix A, provides computationally-convenient 
expressions for the coverage probability and scaled expected length of J(d). Define 
7 = (Cov(r)) ^ 2 r = (l/cr)V 22 1 ^ 2 T. According to this theorem, for given function 
d, both the coverage probability and the scaled expected length of J(d) are functions 
of ||7||. We denote this scaled expected length by e(||7||;d). The numerical inte- 
gration method used to evaluate this coverage probability is described in Appendix 
B. 

Our aim is to find a function d satisfying Restrictions 1 and 2 and such that 
(a) the minimum of P(0 G J(d)) over ||7|| is 1 — a and (b) e(0;d) is minimized 
subject to the restriction that e(||7||;d) < £ for all ||7||, where £ > 1 is chosen 
by the statistician prior to the analysis of the data. Theorem 2, which is stated 
and proved in Appendix C, provides a computationally-convenient expression for 
e(0;d). We expect that, for small m, this constrained minimization will lead to a 
1 — a confidence interval for 9 that has scaled expected length e(||7||;d) that is 
substantially less 1 for 7 = 0. 

We implement the coverage constraint P(9 e J(d)) > 1 — a for all ||7|| > 
as follows. For any reasonable choice of the function d, P{9 e J{d)) converges to 
1 — a as ||7|| — > 00. The constraints implemented in the computations are that 
P{9 G J{d)) > 1 — a for every ||7|| in a judiciously-chosen finite set of values Q. 
That a given Q is adequate to the task is judged by checking numerically, at the 



5 



completion of the computations, that the coverage probability constraint is satisfied 
for all ||7|| > (cf. Farchione and Kabaila, 2012). 

For computational feasibility, we specify the following parametric form for the 
function d. Suppose that x±, . . . , x q satisfy = x x < x 2 < ■ ■ ■ < x q = k. We fully 
specify the function d by the vector (d(xi), . . . ,d(x q ^i)) as follows. The value of 
d(x) for any x G [0, k] is specified by natural cubic spline interpolation for these 
given function values and d(x q ) = t(m) (without any endpoint conditions on the 
first derivative of d). We call x 1: x 2 , ■ ■ ■ x q the knots. Of course, the values of k, 
£ and knots Xi need to be judiciously-chosen and this will usually require some 
computational exploration. 

3. Application to the analysis of data from a single-replicate 2 3 factorial 
experiment 

Consider a 2 3 factorial experiment carried out without replication. Let Y denote 
the response and let xi, x 2 and x 3 denote the coded levels for each of the 3 factors, 
where the coded level takes either the value —1 or 1. We assume the model 

Y = (3 + (3iXi + P2X2 + $3X3 + fi\2X\X 2 + /?13^1^3 + /?23^2^3 + /^m^l^S + 6 

where (3q, (3i, (3 2 , /?3, Pi 2 , P13, /?23, fim are unknown parameters and e ~ N(0,a 2 ), 
where a 2 is an unknown positive parameter. 

For factorial experiments it is commonly believed that higher order interactions 
are negligible (see e.g. Mead (1988, p. 368) and Hinkelman & Kempthorne (1994, 
p. 350)). Indeed, this type of belief is the basis for the design of fractional factorial 
experiments. Assume that ^123 = 0. Also suppose that we have uncertain prior 
information that (5i 2) /3i 3 and /3 2 3 are all zero. Thus n — p = 1. We consider the 
particular case that the parameter of interest interest 9 is a linear combination of 
the main effects i.e. 9 = aifti + a 2 (3 2 + 03/33. In this case, E((t — t)(© — #)) = 0. 

Of course, the properties of J(d), resulting from the constrained minimization 
described in Section 2, depend on the values of k, £, the knots Xi and 1 — a. We focus 
on the particular case that k = 15, £ = 1.02, the knots are at 0, 1, 2, 3, 7, 12, 15 and 
1 — a — 0.95. When we compute the new confidence interval, we obtain the function 
d shown in Figure 1. All of the computations presented in the present paper were 
performed with programs written in MATLAB, using the optimization and statistics 
toolboxes. Consistent with the corollary stated in Appendix C, d(x) takes values 
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larger than t(m). Figure 2 presents graphs of the squared scaled expected length 
and the coverage probability (as functions of HtH) of this new confidence interval. 
The squared scaled expected length and coverage probability computations were 
checked using Monte Carlo simulations. 

The infimum coverage probability is computed to be 0.95. The upper panel of 
Figure 2 demonstrates that this new confidence interval has excellent performance in 
terms of squared scaled expected length. When the prior information is correct (i.e. 
||7|| = 0), we gain since the square of the scaled expected length is 0.34707, which 
is much smaller than 1. The maximum value of the square of the scaled expected 
length is 1.0404, which is only slightly larger than 1. The new 0.95 confidence 
interval for 9 coincides with the standard 1 — a confidence interval when the data 
strongly contradicts the prior information. This is reflected in the upper panel of 
Figure 2 by the fact that the square of the scaled expected length approaches 1 as 

I |7| I — 7- OO. 

4. The effect on the performance of the new confidence interval of in- 
creasing the value of s 

Suppose that the value of m is fixed and that we increase s. It seems plausible 
that the best possible performance of the new confidence interval for 6 will increase 
as s increases i.e. as the amount of uncertain prior information increases. We have 
examined the truth of this plausible result as follows. We have considered m = 1 
and 1 — a = 0.95, chosen £ = 1.02 and the number of knots to be 7. For each 
s = 1,2,3,5 and 7, we have chosen k and the knots so as to minimize the scaled 
expected length at ||7|| = 0. We have obtained the following results: 



s 


Min sq sel 


Max sq sel 


Min CP 


Max CP 


1 


0.80549 


1.0414 


0.95 


0.95049 


2 


0.54698 


1.0404 


0.95 


0.95030 


3 


0.34707 


1.0404 


0.95 


0.95037 


5 


0.25151 


1.0406 


0.95 


0.95034 


7 


0.19027 


1.0404 


0.95 


0.95030 



Table 1: Comparison of results for different values of s, for m — 1, 1 — a = 0.95 and 
£ = 1.02. The column labels Min sq sel, Max sq sel, Min CP and Max CP denote 
the minimum squared scaled expected length, the maximum squared scaled expected 
length, the minimum coverage probability and the maximum coverage probability, 
respectively. 
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Figure 1: Plot of the function d for the new 0.95 confidence interval for 6 when s = 
m = n — p = 1 and t = 1.02. The knots are at 0, 1, 2, 3, 7, 12, 15. 
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Figure 2: Plots of squared scaled expected length e 2 (||7||;d) and coverage probability 
function of ||7|| when 1 — a = 0.95, s = 3, m = n — p = 1 and I = 1.02. 
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If, for each s considered, we assume that the performance of the confidence interval 
in terms of the scaled expected length at | | =0 is about as good as it can be 
then this table tells us the following. As s increases, the amount of uncertain prior 
information increases and this leads to an improvement in the performance of this 
confidence interval. 

5. Discussion 

In this paper we have shown how to construct a frequentist confidence interval 
for the parameter of interest 9 that utilizes the uncertain prior information that 
r = 0. We have done this for the particular case that the covariances between the 
components of the least squares estimator of r and the least squares estimator of 
are all zero. Our practical experience with the computations of this new confidence 
interval, in a variety of circumstances, shows that the coverage probability needs 
to be computed with great accuracy for these computations to be successful. If we 
no longer restrict attention to the particular case that all of these covariances are 
zero then the construction of such a confidence interval necessitates the computation 
of coverage probabilities using more complicated methods of the type employed by 
Kabaila and Farchione (2012). However, the increased computation time of these 
methods would appear to make the constrained optimization not computationally 
practicable. 

Appendix A. Theorem 1 and its proof 

In this appendix, we state and prove Theorem 1, which provides new computationally- 
convenient expressions for the coverage probability and scaled expected length of the 
confidence interval J{d). Define G = (0 — 9)/{<j^/¥[[) and Q = (1/<t 2 )t t V^2 1 t. Also 
define V = \[QJs and W = S/cr. Now, (G, V) and W are independent random 
vectors. The assumption that E((t — t)(0 — 9)) = implies that G and V are 
independent random variables. Thus, G, V and W are independent random vari- 
ables. Note that G ~ A(0, 1), Q has a noncentral x 2 distribution with s degrees of 
freedom and noncentrality parameter ||7|| 2 = 7 T 7 and W has the same distribution 
as y/Xm/ m - 

Theorem 1. Let fy{ ■ ; | |) and fw denote the probability density functions of V 
and W , respectively. Also let $ denote the N(0, 1) distribution function. 
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(a) The coverage probability of J(d) is equal to 

fk POO 



rK poo 

l-a + 2 / ($(wd(x)) - $(wt(m))) fv{xw; w f w (w) dwdx. (1) 
Jo Jo 

For given function d, the coverage probability of J(d) is a function of 
(b) The scaled expected length of J(d) is equal to 

Y poo rk 

1 + t(m)E(W) I Jo ( d ( x )- t ( m ))fv(^\h\\)dxw 2 f w (w)dw. (2) 
For given function d, the scaled expected length of J{d) is a function of \ 

Proof of part (a). It is straightforward to show that the coverage probability 
P{6 e J{d)) is equal to P( - W d(V/W) <G< Wd(V/W)). By the law of total 
probability, this is equal to 

P(-Wd(V/W) <G< Wd(V/W),V/W < k)+P(-t(m)W <G< t(m)W, V/W > 
since d(x) = t(m) for all x > k . Now 

P( - t(m)W <G< t(m)W, V/W >k)+P(- t(m)W <G< t(m)W, V/W < k) 
= P(- t(m)W <G< t(m)W) 
= P(- t(m) < G/W < t(m) ) = 1 - a 

Thus P(6 e J(d)) is equal to 

l-a + P(-Wd(V/W) < G <W d(V/W), V/W < k) 
-P(- t(m)W <G< t(m)W, V/W < k) 

poo pkw 

— l-a+ / (2$ (wd(v/w)) - 1 )f v (v)dv f w (w)dw 
Jo Jo 

(2Q(wt(m)) -l)f v (y)dvfw(w)dw 

POO pKW 

= l-a + 2 / ($ (wd(v/w)) - $ (wt(m)) )f v (v)dv f w (w)dw. 
Jo Jo 

Changing the variable of integration of the inner integral to x — v/w, we obtain 

poo rk 

l-a + 2 / ($ (wd(x)) - $ (wt(m)) )f v (xw; dx wf w (w)dw 
Jo Jo 

rk roo 

= l-a + 2 / ($(wd(x))-$(wt(m)))f v (xw]\\j\\)wfw(w)dwdx 
Jo Jo 



o Jo 

oo pkw 



JO 

oo rkw 
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Proof of part (b). It is straightforward to show that the scaled expected length 
of J(d) is equal to 

E(Wd(V/W)) 
t(m)E(W) ' [) 

We use the notation 

I 1 if A is true 
1(A) = < 

10 if A is false 

where A is an arbitrary statement. Since I(V/W < k) + T(V/W > k) = 1, 
E(Wd(V/W)) is equal to 

E ( Wd(V/W) X(V/W <k))+E{ Wd(Q/W 2 ) X(V/W > k) ) 
= E ( Wd(V/W) 1(V/W < k)) + E(Wt(m) 1(V/W > k) ) 
= t(m)E(W) + E( (d(V/W) - t(m)) W1(V/W < k) ). 
Thus the expression (j3J) for the scaled expected length is equal to 

1 + t{m) l E{w) E ( (d(V/W) - t{m)) WX(V/W < k) ) (4) 



1 

1 + 



f'OQ pkw 

t(m)E(W)J J V\w, 
Changing the variable of integration of the inner integral to x — v/w, we obtain 



d (^j -t(m)j fv{v, H7II) dvw f w (w) dw. (5) 



Appendix B. The method used to evaluate the coverage probability 

In this appendix we describe the numerical integration method used to compute 
the coverage probability P(9 G J(d)), as given by ([T]). The function d, which is a 
cubic spline in the interval [0,k], does not necessarily have a third derivative at each 
of the knots x 2 , ■ ■ ■ ,x q = k. So we evaluate (jTJ) by computing 

9~1 rXi+x poo 

+ / ($(wd(w))-$(wt(m)))f v (xw;\\7\\)wf w (w)dwdx. (6) 

i=1 Jxi JO 

Each of the inner integrals is equal to 

(&(wd(w)) — $(wt(m)))fv(xw; \ I7I |) w fw(w) dw 

= E(($(Wd(x)) - $(Wt(m))) f v (xW; \\^\\) w\ (7) 
where W has pdf fw Let Z = mW 2 , so that Z ~ Xm- Thus d7j) is equal to 



Tn In v \ V in / \ y m I \ y m 



$[J — d(x)) [J — t(m) \ )fv[x\—;\\l\\)y/zf m (z)dz 
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where f m denotes the xL pdf- It can be shown that 



,1/2 f (z) 2^r((m + l)/2) 

2 /mU) " rv/2) /m+l( } - 



Thus (jHJ) is equal to 



(9) 

Assuming that > for all x G [0, fc], we compute this as follows. Let F m denote 
the Xm c df- Now change the variable of integration to u — F m+ i(z), so that (j5J) is 
equal to 

Tr((m+l)/2) f 1 

g(u;x,\\^\\)du 



m T(m/2) J 
where g(u;x, ||7||) is defined to be 



.|^*))-.(^H) J vW^IM 

'(10) 

for all (x, u) G [0, fc] x [0, 1) and the limit of f lTUj) as u approaches 1 from below for 
u = 1 and all x G [0, k}. Thus gr(l; x, ||'y||) = for all x G [0, k}. 

Appendix C. Theorem 2 and its proof 

The following theorem provides a computationally- convenient expression for the 
criterion e(0; d). 

Theorem 2. The criterion e(0; d) zs egna/ to 

2 3/2 s «/2r(( s + m + l)/2) f,, , , ^ , i ™ m/2 , , s 

1+ . . 7'' { / (d(x) - t(mj) X s - 1 r— — 7777^ 11 

*(m)S(W r ) r(m/2) T(s/2) Jo ' (sx 2 + m )^+m+i)/2 v v 

where Y denotes the gamma function. 

Proof. The proof of this theorem uses Theorem 1 (b). It follows from (TS]) that 



e(0; d) - 1 = 77— — — / / (d(x) - t(m)) f v (wx;0) dxw 2 f w (w) dw 
t{m)E{W) J J 

Note that fv(v;0) = 2svf s (sv 2 ), where f s denotes \ 2 S probability density function. 
Interchanging the order of integration, we obtain 



2s 



h 



e(0;d) - 1 = t ( m ^ E ( W ^ J (d(x)-t(m))x j f s ((sx 2 )w 2 )w 3 f w (w)dwdx. 



o 
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Lemma 1. For each y > 0, 



oc 



2 : , , ^ , 2 1 / 2 m m / 2 y^ 2 )- 1 r((s + m + l)/2) 



o /»^^)^ = (, + m) (^)/ 2 r(m/2)r( g /2) • (12) 

Proof. Note that /W( w ) — 2mw f m (mw 2 ) , where / m denotes the xL probability 
density function. Substituting the expressions for f s and fw, we obtain 

By (A2.1.3) of Box and Tiao (1973), this is equal to the right-hand side of (fl2i) . 

□ 

It follows from this lemma that 

- 2 3/2 s s / 2 r((s + m + l)/2) [ k (M ,_, ^ s -i ™ m/2 
e(0;d)-l- t(m) £ Wr(m/2)r(s/2) y o (sx 2 + m)(s+m+ i)/ 2 dx - 

□ 

Appendix D. Some simple results on confidence interval performance 

In this appendix we consider the confidence interval 

j[d) = [e - v ^TSrf(v / F), e + Vuirsd(VF) 



where ti : [0, oo) — >■ (0,oo). We make additional requirements of d, as needed. We 
state some simple results about the performance of this confidence interval. The 
proofs of these results are straightforward and are omitted, for the sake of brevity. 

Theorems 3 and 4 concern the expected length of J(d). Theorem 3 is used in 
the proof of Theorem 4. 

Theorem 3. Suppose that di{x) > d 2 (x) for all x > 0. Then 
E {length of J(rfi)) > E {length of J(d 2 )) for all 



Theorem 4. Suppose that d\{x) > d 2 (x) for all x > and that there exists e > 
and an interval [a, b] (where < a < b) such that d%{x) > d 2 (x) + e for all x G [a, b]. 
Then 

E {length of J {di}) > E {length of J \d 2 )) for all 



13 



Theorems 5 and 6 concern the coverage probability of J(d). Theorem 5 is used 
in the proof of Theorem 6. 

Theorem 5. Suppose that di(x) > d 2 (x) for all x > 0. Then 



Theorem 6. Suppose that d\(x) > d 2 (x) for all x > and that there exists e > 
and an interval [a, b] (where < a < b) such that d±(x) > d 2 (x) + e for all x e [a, b]. 



These theorems have the following three consequences. 

Corollary. Suppose that d is continuous. If d(0) < t(m) and d(x) < t{m) for all 
x > then 
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